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Abstract. This is the fifth of our series of works about the shape parameter. We now explore 
the parameter /3 contained in the famous gaussian function e - ^' 31 ' , x € R n . In the theory of radial 
basis functions(RBF), gaussian is frequently used in virtue of its good error bound and numerical 
tractability. However the optimal choice of (3 has been unknown. RBF people only know that ft is 
very influential, but do not have a reliable criterion of its choice. The purpose of this paper is to 
uncover its mystery. 
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1 Introduction 

Let 

h{x) := e~^\ x e R n , > 0. (1) 

For any given data points (xj,yj), j = 1, • • • , N, where X = {x\, ■ ■ ■ , xn} is a subset of R n and the 
y'jS are real or complex numbers, it's well known that there is a so-called h spline interpolant s(x) 
of these data points, defined by 

N 

s(x) = p(x) + cjh(x — Xj) (2) 
i=i 

where p(x) is a polynomial in P m -i, Tin being the order of the conditional positive definiteness of h, 
and c'jS are chosen so that 

JV 

^c 3 q{x 3 )=Q (3) 

for all polynomials q in P m -i and 

N 

p(xi) + 22 Cjh(xi - xj) =yi, i = l,---,N. (4) 



Here P m -i denotes the class of polynomials in R n of degree < m — 1, and P m -i '■= {0} if m = 0. 
The linear system of equations (3) and (4) has a unique solution (ci, • • • , cn) with p(x) = since 
vn — for (1). Further details can be found in |12) . 

We use s(x) as the approximating function. For error estimates there are two kinds of error 
bound, the algebraic type |13j and the exponential type |14) and [6J . The latter is an improved form 
of the former. In this paper we will show that if the shape parameter /3 is chosen in an appropriate 
way, the exponential-type error bound will be minimal. 



2 Fundamental Theory 

In the theory of RBF, any conditionally positive definite(c.p.d.) function h of order m induces a 
function space called native space, denoted by C/j jJn . Its definition and characterization can be 
found in [3J, [3], [TJ, [T2], [T3J and [IB]. We adopt Madych and Nelson's definition which is quite 
different from that of Wu and SchabackJTTJ. Besides the native space, each / € Ch, m has a seminorm 
11/11?! which plays an important role in our theory. In this paper h is just the function in (1) and 
m — 0. 

There are two main theorems in this paper. Before introducing the first of our main theorems, 
we need a basic definition. 

Definition 2.1 For any positive integer n, the number j n is defined by 71 = 2 andj n = 2n(l+7„_i) 
ifn>2. 

As pointed out in sectionl, the theoretical cornerstone of our approach is the exponential- type error 
bound constructed by Madych and Nelson. However some crucial constants in the error bound had 
been unknown and considered to be a hard question. Fortunately these constants are thoroughly 
clarified in J5]. In [5] the author presents a complete and lucid exponential- type error bound for 
gaussian interpolation. In order to develop useful criteria of the optimal choice of j3, we need the 
following theorem which is cited from [5] directly. 

Theorem 2.2 Let h(x) = , /3 > 0, be the gaussian function in R n . Then, given a positive 

number bo, there are positive constants So, c,and C for which the following is true: If f £ Ch t m and 
s is the h spline that interpolates f on a subset X of R n , then 

\f{x)-8(x)\<A"(C s )*.\\f\\ h (5) 

n-2 n-1 

, where A" = tt^~ ■ (n ■ a n )% ■ 2~z~ ■ (^^-^j 4 for even n and A" = 7rT • (n ■ a n ) 5 • (^^^j * for odd 

n, holds for all x in a cube E provided that (a)E has side b and b> bo,(b)0 < S < So, and (c) every 
subcube of E of side 8 contains a point of X. Here, a n denotes the volume of the unit ball in R n . 
The number c is equal to ^p- where j n was defined in Definition2.1. The number C is equal to 

(3* • e • \J2pfi^fn ■ e 2 "f") 4 ■ b\ ■ j n , where p = g • Moreover, S can be defined by 

r 1 1 

Sq = min 



, where 



(33 -e-y/TpP- V^-e 2 "T") 4 -&o-7n 

t, — r 1 — tt , if n is odd and n > 1 , 

27 (n-2) 1 n * s even an d n> 2, 
" ifn = 2. 
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Note that the error bound (5) tends to zero as 6 tends to zero. This is the key to understanding 
this seemingly complicated theorem. In (5) the constant C highly depends on (3. It's tempting to 
think that in (5) only C is influenced by j3. In fact, \\f\\h also changes as j3 changes. The change of 
|| /|| h cannot be seen in a transparent way. Therefore Theorem2.2 cnnot be used directly to find the 
optimal (3. 

In order to overcome this problem, we introduce two function spaces as follows. 
Definition 2.3 For any a > 0, the class of band-limited functions in L 2 (R n ) is 

B a :={/ei 2 (iT) :/(£)= (H/ |£| > a} 
, where f denotes the Fourier transform of f. 
Definition 2.4 For any a > 0, 



G a := {/ E L 2 (R n ) : J |/(£)| 2 e^ < oo} 



where f denotes the Fourier transform of f . For each f £ G a . 

\\f\\G a :=|y |/(0| 2 e^ ,1/2 
Let's investigate B a first. For each / € B a , by Corollary3.3 of [13] 



{2Tt) 2n h{d 



i 



< 



(2tt)" 
1 



1/2 

|/(0| 2 e^-^j 

1/2 



Wr eV \l 1/(1)1 * 

Substituting this result into (5), we obtain the following useful theorem. 

Theorem 2.5 For any a > 0, f € B a implies f £ Ch. m and (5) can be transformed into 

\f{x) - s(x)\ < A"(C5)f • (27T)"" • ew ■ \\f\\ LW (6) 
, where h is defined as in (1). 

Functions in G a can be treated in a similar way. For any / 6 G ai we have 

1 f f , ^ ,o L£J 



(2tt 



1/2 



1/2 



1 Kii_LLL , 



This gives the following theorem. 
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Theorem 2.6 For any a > 0, / € implies f € C^m and ^5J can 6e transformed into 

|/(a;) - s(ar)| < A"(C«5)f (2tt)-" j sup e^^" I ||/|| G(7 (7) 
, where h is defined as in (1). 

There is an improved exponential-type error bound from which a set of criteria for the choice of the 
shape parameter can also be developed. In this kind of error bound the data points are not purely 
scattered and the interpolation happens in an n-simplex. 

The definition of n-simplex in R n can be found in ;2_. 1-simplex is a line segment. 2-simplex is 
a triangle. 3-simplex is a tetrahedron. 

Let T n be an n-simplex in R n with vertices v n +i- Any x € T n can be written as a convex 

combination of the vertices: 

n+1 
X=) j CiVi 

1=1 

where Y17=i c i = 1 an d <H > for all i. We call (ci, • • • , c„+i) the barycentric coordinate of x. Let's 
define 'evenly spaced' points of degree I to be those points whose barycentric coordinates are of 
the form 

i~Ti ~T'" ' ' )i fcj nonnegative integers with ki — I. 

i=i 

It's easily seen that the number of such points in T n is exactly AT = dimPf 1 . Also, as stated in 
PP, evenly spaced points form a determining set for P™, the space of polynomials of degree / in n 
variables. 

We can base our criteria of choosing /3 on a crucial theorem which we cite directly from [5] with 
a slight modification. 

Theorem 2.7 Let h(x) := e~^ be the gaussian function in R n . For any positive number bo, 
there are positive constants 5q,Ci,C2, and C3 independent of n, for which the following is true: If 
f € Ch, m , the native space induced by h, and s is the h spline that interpolates f on a subset X of 
R n , then 

\f(x)-s(x)\<CiV5(c 2 8)^ -\\f\\ h (8) 

for all x in a subset f2 of R n , and < S < 5o, where ft satisfies the property that for any x in Q and 
any number kj- < r < bo, there is an n simplex Q with diameter diamQ = r, x € Q C fl, such that 
for any integer I with % < I < 2^2. j there is on Q an evenly spaced set of centers from X of degree 
I — I. (In fact, the set X can be chosen to consist of these evenly spaced centers in Q only.) Here 
\\f\\h is the h-norm of f in the native space. The numbers 6q,Ci,C2, and C3 are given by 



(5 :=min|fo , pi.^r.^ } where p 3 = l2i-y/e/3 ; 
where A" is defined by 



I A" • -JL_ ■ 1 for odd n, 
1 A " ' 7157 ' 757 for even n > 

'■■>■■ /'.;-:i :i -2-/ )j ; , 

4 ' 



C3 



J2 + \ ■ 7r 4 • (na n ) 2 • 2 4 ■ p 4 J" or odd n, 

n-l , . 1 n+1 n-2 „ 



A" := < V e _ .where the number a„ denotes 



7r 4 . (na n )2 -24 ■ p 4 Jor euen n, with p ■■ 



the volume of the unit ball in R n . In particular, if the point x in Q is fixed, the only requirement for 
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fl is the existence of an n simplex Q, with diamQ = r, x £ Q C ft, satisfying the afore-mentioned 
property of evenly spaced centers, and the centers x\,- ■ ■ ,Xn in s(x), as defined in (2), are just the 
evenly spaced points in Q. 

Remark: This seemingly complicated theorem is in fact not difficult to understand. The number 
S is in spirit equivalent to the well known fill-distance. The error bound (8) tends to zero rapidly 
as S tends to zero. In practical application only finitely many interpolation points will be involved. 
Hence only a finite number of simplices will appear, even if Q is unbounded. Note that among ci, c 2 
and C3, only c 2 depends on the shape parameter j3. It's tempting to think that the influence of j3 on 
(8) happens only in c 2 . This is wrong. Be careful that, as in (5), \\f\\h depends on (3 also. 
By transforming the norms we obtain the following useful results now. 

Corollary 2.8 Suppose f € B a ,a > 0. Then (8) can be transformed into 

\f(x) - s(x)\ < Cl V5(c 2 6)^ • (2tt)-" • ew . \\f\\ L , {Rn) . (9) 
Corollary 2.9 Suppose f e G a ,a > 0. Then (8) can be transformed into 

if 2 3 1 1/2 

- s{x)\ < Cl VS(c 2 S)^j^ I sup e^r-^ J \\f\\ G . (10) 



3 Criteria of Choosing f3— the Scattered Type 

In (6) the parts influenced by (3 are (C5)% and . By the definition of C and c, one can easily 
find that the function 

MN((3) := (3^ ■ eW 

describes the dependence of the error bound (6) on [3. Let's call this function the MN function 
and its graph the MN curve. Obviously, finding the optimal j3 is equivalent to finding the value {3 
minimizing MN((3). 

Similarly, in (7) the MN function is 



MN(13) := (3^ I sup^ijneT ^ 



1/2 



Another important thing which remains to be investigated is <5. Our fundamental theory begins 
at Theorem2.2 where < 5 < So is a requirement. By the definition of So, we find 

S < S iff S < S n and S < 



iff S<S n and (3 2 < 



(33ev / 2p^e 2n T") 4 ^7„ 
1 



(33ey2^e 2 "T") 4 6g7 n (5 

iff S < S n and (3 < — ^ . 

ihpne^+^y/^VS 

This severely restricts the ranges of S and (3. However the upper bound of f3 can be made arbitrarily 
large because S can be arbitrarily small, theoretically. 
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We summarize these results in the following two criteria. 



Casel. / e B a Let a > and / € B a . Under the conditions of Theorem2.2, for any fixed 
b > and 5 < 8q, the optimal value of (3 in the interval (0, /3 ] where /3 = — 3 — 3 is 

the number minimizing 



MiV(/3) := /3^e^. 



Case2. 



/ G G CT Let (7 > and / G Go-. Under the conditions of Theorem2.2, for any fixed bo > 

1 is the 



and <5 < <5 , the optimal value of /3 in the interval (0,/3q] where /3 = 



number minimizing 



33 2pne 4 ""'«+ 2 bJ v ^Tv / 5 



MiV(/3) := 




Remark:(a)In both cases MN(/3) —> 00 both as /3 — > + and j3 — > 00. (b)The number /3 mini- 
mizing MN(j3) can be obtained by Mathematica or Matlab. 

Numerical Results:We now present some pictures of the MN function. The lowest point of the MN 
curve corresponds to the optimal choice of the shape parameter (3. In order to make the graph look 
better, we sometimes multiply the function by a constant k and call it the modified MN function. 
Its graph will then be called the modified MN curve. 



Modified MN Curve with k= 10 

kMN(/J) 



and 5=0.01 




0.00004845 0.00004846 0.00004847 0.00004848 



Figure 1: / € B a , n = l,a = 1, 60 = 1 
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MN Curve with (5=0.001 



MN(jS) 



3.5x10"' 
3.x 10-' 

2.5x10-' 
2.x 10-' 

1.5x10-' 
1.x 10-' 
5.x 10-' 



0.0001510 0.0001515 0.0001520 0.0001525 0.0001530 

Figure 2: / e B a ,n = l,a = 1, 60 = 1 
Modified MN Curve with k= 10 4457 and (5=0.0001 




0.000094 0.000096 0.000098 0.000100 0.000102 0.000104 0.000106 



Figure 3: / <E B a ,n = 1,(7 = 1, bo = 1 
Modified MN Curve with k=10 57070 and (5=0.00001 




9.8xl0- 6 9.9xl0- 6 0.00001 0.0000101 0.0000102 



Figure 4: / <E B a ,n = 1,(7= l,&o = 1 
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Modified MN Curve with k=10 695710 and (5=0.000001 

kMN(/J) 



0.08 - 



0.06 - 



0.04 - 



0.02 - 




Figure 5: / € B a ,n = I, a = 1, bo = 1 

Modified MN Curve with k=10-' 04322 and 8=W~ 5 
kMN(y8) 








0.001533«K001533«16001533O60015338J60015338M)015338n00153307 



Figure 6: / € G CT , n = 1, <r = 1, feo = 1 
Modified MN Curve with k=10 17277 and <5=5xl0~ 6 

kMN(,8) 



2.5 xlO 12 






2.0xl0 12 






1.5 xlO 12 






l.OxlO 12 






5.0x10" 








0.00216795 0.00216800 


0.00216805 




Figure 7: / G G CT , n = 1, <r = 


l,6o = l 
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Modified MN Curve with k= 10 

kMN(/3) 



296760 and 6=10~ 6 



3.5 X10 10 
3.0x10'° 
2.5 xlO 10 
2.0x10'° 
1.5x10'° 
1.0x10'° 



0.003025 



0.003030 



0.003035 



0.003040 



Figure 8: / G G a , n = 1, a = 1, &o = 1 
Modified MN Curve with k=10 618590 and <5=5xl0 _7 

kMN(/3) 



800001 



60000B- 



400000 



200000 




0.002400 



0.002405 



0.002410 



Figure 9: / € G^, n. = 1, a = 1, bo = 1 
Modified MN Curve with k=10 3384170 and (5=1CT 7 

kMN(y3) 



12 



10 



0.0^4050^X0014055 ll.onModo until. 0.0014070 



Figure 10: / G G a ,n = 1,<j = l,bo = 1 



Although we didn't show the whole curves of the MN or modified MN functions, the crucial parts 
were presented. 

Remark: In our criteria both 5 and j3 have an upper bound which is quite restrictive, especially for 
high dimensions. This seems to be an important topic and deserves future research. 



4 Criteria of Choosing (3— the Evenly Spaced Type 

Formula (9) and (10) provide us with a very good theoretical ground to choose (3. 
In (9), if we extract the parts influenced by {3, we will get a function of {3, i.e. 

As before, let's call it the MN function and denote it by MN(f3). Thus 

MN{(3) := . 

Its graph will be called the MN curve. Finding the optimal (3 is then equivalent to finding the 
number minimizing MN(f3). Similarly, in (10), the MN function is 



MN(j3) := \ sup ' - 



1/2 



Note that in Theorem2.7 we require that 5 < 5q where i5o highly depends on (3 and should be treated 
in a rigorous way. 

By the definition of So, 

S < S iff 5 < bo and 8 < 



12e 2 /3 2 27 


•2^3 


1 




" 12e 2 27- 




1 





iff S < bo and (3 < 

lUy/2eb^VS 

These results can be summarized in the following criteria. 



/ € B a Let a > and / € B a . Under the conditions of Theorem2.7, for any fixed 



Casel. 

5, < 5 < bo, the optimal choice of (3 in the interval (0,/?o] where (3 — — s — is the number 

144V2e6 2 VS 

minimizing 

MN{(3) := p b $eW. 

Numerical Results: 



Note that in this case the optimal choice of f3 is independent of the dimension. Therefore in our 
numerical examples we will totally ignore the influence of n. 
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MN Curve with 5=0.1 




0.0045 0.0050 0.0055 



Figure 11: / G B a , a = 1, b = 1 
Modified MN Curve with k=10 110 and 5=0.01 

kMN(y8) 



1 500 




Uoo 




SOO 




200 




100 





0.0025 0.0030 0.0035 



Figure 12: / € B a , a = 1, b = 1 
Modified MN Curve with k=10 1580 and 5=0.001 

kMN(j3) 



0.020 



0.015 



0.010 



0.005 



0.00024 



0.00026 



0.00028 



0.00030 



Figure 13: / € B„, a = 1, b = 1 
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Modified MN Curve with k=10 2U8JS and £=0.0001 

kMN(/3) 




0.0000245 0.000025 0.0000255 0.000026 

Figure 14: / G B a , a = 1, b = 1 
Modified MN Curve with k=10 258386 and (5=0.00001 

kMN(/3) 




2.49 xl0~ 6 2.5 xlO" 6 2.51 xlO" 6 2.52 X 10~ 6 

Figure 15: / G B a , a = 1, b = 1 



Now we begin the second case. 



Case2. / G G a Let <r > and / G G a . Under the conditions of Theorem2.7, for any fixed 
5, < 5 < &o, the optimal choice of (3 in the interval (0,/?o] where /?o = — is the number 

minimizing 

bo ( |e|2 |5|3 ^/ 2 

MN{j5) := ^ sup ~ \ 

Numerical Results: 

Here, also, the optimal choice of (3 is independent of n. 
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kMN(/J) Modified MN Curye wkh k= 1Q-2700 an( J 6=Q l 

7xl0 35 h 



6xl0 35 
5 x 10 35 
4xl0 35 
3xl0 35 
2xl0 35 
1 x 10 35 



0.00568 



0.00569 



0.00570 



0.00571 



Figure 16: / G G CT) a = 1, b = 1 
MN Curve with 5=0.01 



MN(y8) 



1.4X10 11 
1.2x10" 
1.0x10" 
8.0xl0 10 
6.0x10'° 
4.0x10'° 
2.0x10'° 



0.0172 



0.0174 



0.0176 



0.0178 



0.0180 



Figure 17: / G G CT) a = 1, 6 = 1 
Modified MN Curve with k=10 787 and 5=0.001 



kMN(yS) 

3.5 
3.0 
2.5 
2.0 
1.5 
1.0 



0.0185 0.0190 0.0195 0.0200 

Figure 18: / G G a , a = 1, b = 1 
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Modified MN Curve with k=10 9541 and 5=0.0001 

kMN(/3) 




Figure 19: / G G a , a = 1, b Q = 1 
Modified MN Curve with k=10 1 12070 and 5=0.00001 

kMN(/?) 
5x 10 15 { 

4x 10 15 - 

3x 10 15 - 

2x 10 15 - 

lxlO 15 - 



0.00405 0.00410 0.00415 0.00420 

Figure 20: / € G CT , a = 1, b = 1 

As in section3, although we didn't present the entire MN curves in virtue of the restriction of 
Mathematica, the crucial parts were presented. 

Remark: (a)In our criteria there is an upper bound — 3 — for j3. This is a drawback of our 

144V2eb| \/5 

theory and deserves future research, (b) Theorem2.7 requires that interpolation happens in an 
n-simplex with centers(interpolation points) evenly spaced points. It means that the data points 
are not purely scattered. This is also a drawback. However the shape of the simplex, and hence 
the distribution of the centers, is very flexible, making this drawback harmless. (c)In section4 the 
criteria do not depend on the dimension. This is the main advantage over the criteria of section3. 
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